Learning in Computer Vision and Image Understanding
نویسنده
چکیده
Using learning in segmentation or recognition tasks has several advantages over classical model-based techniques. These include adaptivity to noise and changing environments, as well as in many cases, a simplified system generation procedure. Yet, learning from examples introduces a new challenge getting a representative data set of examples from which to learn. Applications of learning systems to practical problems have shown that the performance of the system is often critically dependent on both the size and quality of the training set. Federico Girosi of MIT suggested the use of prior information as a general method for synthesizing many training examples from few exemplars. Prototypical transformations are used for general 3D object recognition. Face-recognition was presented as a particular example. Dean Pomerleau of Carnegie Mellon addressed the training data problem as well, within the context of ALVINN, a neural network vision system which drives an autonomous van without human intervention. Some general problems emerge, such as getting sufficient training data for the more unexpected scenes including passing cars and intersections. Several techniques for exploiting prior geometric knowledge during training and testing of the neural-network, were presented. A somewhat different perspective was presented by Bartlett Mel of Caltech. Bartlett introduced a 3D object recognition approach based on concepts from the human visual system. Here the assumption is that a large database of examples exists, with varying viewing angles and distances, as is available to human observers as they manipulate and inspect common objects.
منابع مشابه
Cluster-Based Image Segmentation Using Fuzzy Markov Random Field
Image segmentation is an important task in image processing and computer vision which attract many researchers attention. There are a couple of information sets pixels in an image: statistical and structural information which refer to the feature value of pixel data and local correlation of pixel data, respectively. Markov random field (MRF) is a tool for modeling statistical and structural inf...
متن کاملSimilarity measurement for describe user images in social media
Online social networks like Instagram are places for communication. Also, these media produce rich metadata which are useful for further analysis in many fields including health and cognitive science. Many researchers are using these metadata like hashtags, images, etc. to detect patterns of user activities. However, there are several serious ambiguities like how much reliable are these informa...
متن کاملارزیابی یک سیستم بینایی ماشین از راه اندازهگیری و تخمین شماری از ویژگیهای فیزیکی پسته
In order to increase the role of machine vision in agricultural research in Iran, especially for measuring physical attributes of seeds, a machine vision system was developed using a computer, a capture card, a video camera and a light box. All equipment was purchased from domestic markets. Computer programs were developed for hardware setup and for image processing applications. The programs p...
متن کاملارزیابی یک سیستم بینایی ماشین از راه اندازهگیری و تخمین شماری از ویژگیهای فیزیکی پسته
In order to increase the role of machine vision in agricultural research in Iran, especially for measuring physical attributes of seeds, a machine vision system was developed using a computer, a capture card, a video camera and a light box. All equipment was purchased from domestic markets. Computer programs were developed for hardware setup and for image processing applications. The programs p...
متن کاملTwo New Methods of Boundary Correction for Classifying Textural Images
With the growth of technology, supervising systems are increasingly replacing humans in military, transportation, medical, spatial, and other industries. Among these systems are machine vision systems which are based on image processing and analysis. One of the important tasks of image processing is classification of images into desirable categories for the identification of objects or their sp...
متن کاملFacial Expression Recognition Based on Structural Changes in Facial Skin
Facial expressions are the most powerful and direct means of presenting human emotions and feelings and offer a window into a persons’ state of mind. In recent years, the study of facial expression and recognition has gained prominence; as industry and services are keen on expanding on the potential advantages of facial recognition technology. As machine vision and artificial intelligence advan...
متن کامل